Picture for Gilles Stoltz

Gilles Stoltz

LMO, CELESTE

Symphony of experts: orchestration with adversarial insights in reinforcement learning

Oct 25, 2023
Viaarxiv icon

Parameter-free projected gradient descent

May 31, 2023
Figure 1 for Parameter-free projected gradient descent
Figure 2 for Parameter-free projected gradient descent
Viaarxiv icon

Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness

Add code
May 25, 2023
Figure 1 for Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness
Figure 2 for Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness
Viaarxiv icon

On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits

Sep 30, 2022
Figure 1 for On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits
Viaarxiv icon

Contextual Bandits with Knapsacks for a Conversion Model

Jun 01, 2022
Figure 1 for Contextual Bandits with Knapsacks for a Conversion Model
Figure 2 for Contextual Bandits with Knapsacks for a Conversion Model
Viaarxiv icon

A Unified Approach to Fair Online Learning via Blackwell Approachability

Jun 23, 2021
Viaarxiv icon

Diversity-Preserving K-Armed Bandits, Revisited

Oct 05, 2020
Figure 1 for Diversity-Preserving K-Armed Bandits, Revisited
Viaarxiv icon

Adaptation to the Range in $K$-Armed Bandits

Jun 05, 2020
Figure 1 for Adaptation to the Range in $K$-Armed Bandits
Viaarxiv icon

Hierarchical robust aggregation of sales forecasts at aggregated levels in e-commerce, based on exponential smoothing and Holt's linear trend method

Jun 05, 2020
Figure 1 for Hierarchical robust aggregation of sales forecasts at aggregated levels in e-commerce, based on exponential smoothing and Holt's linear trend method
Figure 2 for Hierarchical robust aggregation of sales forecasts at aggregated levels in e-commerce, based on exponential smoothing and Holt's linear trend method
Figure 3 for Hierarchical robust aggregation of sales forecasts at aggregated levels in e-commerce, based on exponential smoothing and Holt's linear trend method
Figure 4 for Hierarchical robust aggregation of sales forecasts at aggregated levels in e-commerce, based on exponential smoothing and Holt's linear trend method
Viaarxiv icon

Target Tracking for Contextual Bandits: Application to Demand Side Management

Jan 28, 2019
Figure 1 for Target Tracking for Contextual Bandits: Application to Demand Side Management
Figure 2 for Target Tracking for Contextual Bandits: Application to Demand Side Management
Viaarxiv icon